# High-performance inference

ICONNAI ICONN 1 GGUF
Other
Quantized version of ICONN-1, offering multiple quantization options to meet different performance and quality requirements
Large Language Model
I
bartowski
609
6
Nvidia AceReason Nemotron 1.1 7B GGUF
Other
This is a quantized version of the NVIDIA AceReason - Nemotron - 1.1 - 7B model, which optimizes the model's running efficiency on different hardware while maintaining certain performance and quality.
Large Language Model Supports Multiple Languages
N
bartowski
1,303
1
Deepseek Ai DeepSeek R1 0528 GGUF
MIT
DeepSeek-R1-0528 is a large language model that has been quantized to optimize its running efficiency on different hardware.
Large Language Model
D
bartowski
2,703
6
Seed Coder 8B Instruct GGUF
MIT
Seed-Coder-8B-Instruct is a powerful open-source code model with features such as model-centricity, transparency, and high performance, and it performs excellently in various coding tasks.
Large Language Model Transformers
S
unsloth
3,391
1
PARD Llama 3.2 1B
MIT
PARD is a high-performance speculative decoding method that can convert autoregressive draft models into parallel draft models at low cost, significantly accelerating the inference of large language models.
Large Language Model Transformers
P
amd
2,219
1
Instella 3B Stage1
Other
Instella is a series of 3-billion-parameter open-source language models developed by AMD, trained on AMD Instinct™ MI300X GPUs, outperforming other fully open-source models of the same scale.
Large Language Model Transformers
I
amd
397
12
Llama 3 Swallow 8B Instruct V0.1
A Japanese-optimized large language model built on Meta Llama 3, enhancing Japanese capabilities through continuous pre-training and improving instruction-following abilities through supervised fine-tuning.
Large Language Model Transformers Supports Multiple Languages
L
tokyotech-llm
13.88k
20
Command R
C4AI Command - R is a research version of a high-performance generative model with 35 billion parameters, optimized for various use cases such as inference, summarization, and question-answering.
Large Language Model
C
cortexso
748
2
Yolov8
YOLOv8 is the latest generation object detection model developed by Ultralytics, building on the success of previous YOLO versions with new features and improvements to further enhance performance and flexibility.
Object Detection
Y
Ultralytics
5,391
212
Yi 34B 200K
Apache-2.0
The Yi series of models are next-generation open-source large language models trained from scratch by 01.AI. They support bilingual (Chinese and English) and perform excellently in language understanding, common-sense reasoning, reading comprehension, etc.
Large Language Model Transformers
Y
01-ai
12.63k
317
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase